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This study shoMS that the test statistic for Analysis 
of Covariance (AKCOVA) has a noncentral F-districution with 
noncentrality parameter equal to zero if and only if the regression 
planes are homogeneous and/or the vector of oTerall covariate means 
is the null vector. The effect of heterogeneous regression slope 
parameters is to either increase or decrease the power of the E-test 
of AKCOVA* This depends on the relat^ionship of the true difference of 
the treatment effects of two group^ with respect to the dot product 
of the vector of overall covariate means and the vector of the 
difference of the two groups' slope parameters* (Author/RC) 
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General discussions of the misuse of the analysis of covarlance 
statistical procedure may be found In articles by Elashoff (1969) and 

CO 

O Glass, Peckham and Sanders (1972)* Two problems which are discussed 

^ In these references are the robustness and the power of the F-test 



A/ 



of the analysis of covarlance with the presence of heterogeneous 
regression slopes* Both of the problems may be addressed by application 
' of a K-sample regression linear model, 

A model equation for a one-way classification analysis of 
covarlance procedure Is 

= k - !y + \h !k ' 

k 

where Is the treatment effect (fixed) In group k, Pj^ Is the matrix 
of observed covarlates In group k, P Is the Ixp row vector of covarlate 
means taken over the K samples and k = 1, 2, , , , , K, 
Let Oj^ = U + Tj^ - P3j^» then 

!k " \\ ?k?k !k • . 
Is* f 

The analysis of covarlance procedure tests the hypothesis 
Hq: Tj^ = T2 = * * * ^ tj^ by testing the hyjpothesls Hj^: \ ~ ^2 ~ ' 



- ^ , . , J a^^^ - = 0, The hypotheses Hq and H^^ are 

* paper presented at the Annual Meeting of the American Educational 
Research Association^ April, 1976, 
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equivalent if &^ = ^ * * * 

Note also H and are equivalent If P = 0\ 

Equation 1 has the form of a K*sample regression model and 
H^: Tj^ = " * * * ■ Tj^ Is tested by testing the equality of the 
Intercept parameters Oj^ , k = 1, 2, , , , K, of the model. Thus the 
treatment effects are tested at the points where the regression planes 
defined by each group Intersect the criterion variable axis, 

Atlqullah (1964) used analytic methods to Investigate the effect 
of treatment slope Interaction In analysis of covarlance with one * 
covarlate. His Investigation showed that the e:cpected value of the 
difference of two estimated treatment effects 1^ biased when homogeneity 
of regression Is not tenable, 

Pecl^am's empirical study (1970), based on Atiqullah^s work, 
suggested that the F-test of the analysis of covarlance is increasingly 
conservative as the variance of the within group regression slope 
parameter Increased, His result differs from the one to be developed 
here» because his design chose the unequal regression lines to intersect 
at the dependent variable intercept, the point at which treatment 
effects are tested, 

Kocher (1974) and Hamilton (1974) Independently considered the 
same problem in the case when the covarlate was a random variable. In 
both studies, the computer-generated values of the covarlate yielded a 
population grand mean of zero for the covarlate^ thereby creating nearly 
the same situation as in Pacl^am^s study. 
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These three empirical studies found that» In general^ the probability 
of rejecting the true null hypothesis of equal treatment effects decreased 
when the separate group regression lines are not parallel. One should 
guard against over generalizing from the results of these Investigations 
for the following reasons; (1) The Investigators had problems with sim- 
ulating the linear model for analysis of covarlance. These problems 



Included unequal group error variances and non^-zero correlations between 
the covarlate and the error variable, (2) Each of the Investigators made 
certain assumptions about the linear model* The assumption that the grand 
mean of the covarlate Is zero has a definite effect on the outcome of this 
problem. And, (3) each study dealt with only one concomitant variable. 
Analytical Investigation of Robustness Using the K-Sample Regression 
Linear Model 

A K"sample regression linear model will allow an analytical study of 
the effect of heterogeneous regression planes to the analysis of covarlance 
procedure. 

The model equation of analysis of covarlance^ Equation 1 of the previous 
section, may be Incorporated Into the linear model 



y = Xe + e 



(2) 



where 



y 




^^l'^l*^2'^2* 




(3) 



e 



(^1 J J * * * 




and 



4 




4 
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X = DIAGfCl^^:P^) . Cl^^:y ..... (1^^:^] . 

1. e. X is a super diagonal matrix. 

The analysis of covariance hypothesis Hj^: " "2 " ^* "2 
. . . , a^_^ - = 0 may be written as H^i L'p » 0 where 



°3 ' °» 



1 0' -1 0' 0 
0 0' 10' -1 



0 0' 0 0' 
0 0' 0 0* 



0 0* 0 0* 0 ... 1 0* -1 0' 



(4) 



L* is a (K-l) X K(p+1) matrix of full row rank and 



L'p = 



\ ' Ih - \ 111 



(5) 



If Pj^ = ^2 " • • • ~ * 



and L'e = 0 if x^^ T2 " = . . . = Xj, , with the premise of homogeneity 
of regression. 

Suppose we rewrite Equation 5 as 
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o o 



o o 



and let = ^^^k+i " P^^) » 1= " 1» 2, . . . , K - 1. Define 

Y = (Yj^» Y2» • • • » Yjr_j^>'' Note that if P = 0', then y " 0. Also let 

A = [(t^ - » - T^) » . . . » (Tj,_j^ - Tj,)]', then L'p = A + y* 

Now if e; K(0, o^i ) then, the F-statistic for testing H, - 0 is 



SSE/(N-Kp-K) 



where SSE = y' tl-X(X'X) ^X'] y . F(H,) has non-central F-distribution 



with non-centrality parameter X = p'L [L* (X'X)~1l]~^ L*e/2o2 .denoted 



F(Hj^); F(K-1, K-Kp-K, X). 

Let A = tL*(X'X)"lL]"V2o2 , then A = (A + y) ' A(A + y) or 

X = A'AA + y'Ay + 2y'AA . (6) 

Suppose the treatment effects are equal, A = 0, then X = y'Ay • Since 
A is, a positive definite matrix X is greater than or equal to zero, and 
equals zero if and only if y 0 , i.e. ^en the regression planes are 

homogeneous or when P = 0*. 
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Therefore FCHj^) has 3 central F-distribution when the hypothesis 
Hj^ is true only if y = 0* In general when P ^ 0' » a consequence of 
violating the homogeneity of regressions assumption when using the analysis 
of covariance procedure is to increase the probability of a Type I error, 
rejecting when is true. This follows from the fact that the expected 
value of a non-central F^statistic is larger than that of a corresponding 
central F-statistic, That is, EtF(Hj^)] = n^Cl + 2A/n^)/(n2 - 2) when 
F(Hj^) is a non-central F^statistic with n^^ and degrees of freedom and 
non-centrality parameter A, If A = 0, then E[F(Hj^)] = n2/Cn2 ^ 2) (see 
Searle, 1971, p. 51), 

There is a shortcoming with the above approach which does affect the 
outcome to some extent. The model above is not the usual analysis of 
covariance model* The usual model estimates the within group slope 
parameters, by pooling the observed covariate data from all the treat- 
ment groups while the above model estimates the within group slope para- 
meters K times, using the observed covariate data within each treatment 
group . 

The model equation of a one-way classification analysis of covariance 
with the premise of homogeneity of regression, in the form of Equation 2, 
is 

y = X*3* + , 
x^ere y is as before, e* has the same distribution as 6* 

o ^ 

= C&, a*> a*, • • • , a*)^ , 

and 
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X* 



0 



0 
0 



K 



For this model the unbiased estlmatis of 
o2 = y' {I-X*CX*'X*)"1X*'] y/CN-K-p), while, for the model used In the 

previous development, the unbiased estimate of Is 

0^ = y' II-XCX'X)"^X'] y/(N-Kp-K) . 
Also the usual analysis of covarlance procedure estimates a* by 

Q* a - P^g while the model in this study estimates ol^ by 

The differences between the two models can only lead to the conclusion 
that the models are equivalent for large samples given reasonable values 
of K and P- Theoretically the two models are equivalent, given homogeneity 
of regression, as n^^ approaches infinity. 

However, the conclusions of an increased probability of committing 
a Type I error when homogeniety of regression is not tenable still stands. 
For this is an optimum result given a large sample and in statistics 
results are usually less than optimum when based on small samples. 
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Power of the Analysis of Covarlance Test of Hypothesis 

The model developed In the previous section may also be used to 

Investigate the power of the analysis of covarlance test of the hypothesis 

of equal treatment effects^ 

The power of a test Is the probability of rejecting the null hypothesis 

when It Is not true or the complement of the probability of committing a 

Type II error. 

The consideration of this topic, using the present approach, Is of 
little practical value since one usually uses a power function to determine 
a minimum sample size for which a prespeclfled practical difference between 
treatments would be judged statistically significant. Considering the 
comments at the end of the previous section, the model we shall use will 
yield optimum results for large samples only. But the Importance of this 
topic can be summed up In two sentences found In an article by Glass, 
Peckham and Sanders (1973, pp, 277-9), They wrote, "Further study Is 
needed to determine the effects of unequal slopes on the power of analysis 
of covarlance. This could very well be the crucial Issue," 

The null hypothesis of analysis of covarlance Is : L'& = 0 where 
S and L' are defined in the previous section by Equations 3 and 4 respectively. 
The test statistic of is 

F(H^) = ^ - ^ - 

SSE/(N-Kp-K) 

and F(Hj^): F(K-1, N-Kp-K) when H^^ is true. If H^^ is aot true, then 

F(H,): F'(K-1, N-Kp-K, A), where A = (L'S)' [L'(X'X)"1l]"1 a'B)/2a2 . 

9 

ERIC 



-9- 



Let P(II> be the probability of a Type II error occurring, then 
$(A) = I - P(II) is, by definition, the power of the test evaluated at 
the point X. We shall refer to $(A) as the power function. 

Now P(II) = Pr fF(Hj^) < Fj^_^(K-l, N-Kp-K) } when 
F(Hj^): F' (K-1, N-Kp-K, A) , or 

P(II) « Pr {F* (K-1, N-Kp-K, A) < Fj^_^'(K-l, N-Kp-K)} 
where a is the prescribed level of significance of the test. Thus 

e(A) = Pr {F* (K-1, N-Kp-K, A) > Fj^_^(K-l, N-Kp-K)} 
(see Searle, 1971, p. 126). 

From Equation 6 of the previous section 
A = A*AA + y*Ay + 2y*AA 
where A = <t^ - - . . . , j^^^ - j^)^ 

and , A = [L'(X*X)'1l]'V2o2 . 

If Y " 0 and A - 0, then A = 0 and 

e(A) = Pr {F*(K-1, N-Kp-K, 0) > Fj^_^(K-1, N-Kp-K)} 

or e(A) = a, and thus the test of the hypothesis : = 0 is a level 
a test. 

If Y = 0, then A A*AA and A > 0 if A 0. 
If A « 0» then A = y'Ay and A > 0 if y 0. 

When L i 0 and y i'^ 0» then A ^ 0 and A " 0 only when A = -Y since 

X ^ (A + y)"* A(A + y) and the matrix A is positive definite. Thus when 
^ *• ^ *• *• 

o o o o o o 

(tj^ - T2 - T3, . . . , Tj^_j^ - = '!^?2 " ?1' b ' ?2* ' ' ' * ?k " ?K-l'* 
the power function equals a. 

10 
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Thus to restate the conclusion of the previous section^ when y " 0, 
the power- function Is an Increasing function of A * &'AA and the minimum 
of the power function occurs at the point A ^ 0, However, when Y ^ 0, 
the minimum of the power function now occurs at the point for which 
A = (A + T')' A(A ^ y) =0, So that e(A) > a and equals a If and only If 
i ^ -Tf, Hence, barring the rare event of A ^ -Tf, the level of significance 
of Hj^ will always be greater than its prescribed value of a, when y ^ 0^ 

How the power function when y = 0 differs from the power function 
when y ° a ^ 0 depends on A, Let A- = A'M and A« = (A + a) ' A(A + a) , 
then ^2 " ^1 " 2a'AA and e(Aj^) < HJ^2^ ^ ^2' ^^^^ again 

that p(A2) > a since A2 - Aj^ > 0 when A = 0, Let f(A) = ^2 " ^1 
f(A) = a'A(a + 2A) , then . 

f(a) = 3a'Aa , 

f(-a/2) - 0 , 

f(0) = a'Aa , 
and f(-a) ^ -a'Aa , 

Thus 

e<Aj^) < e(A2) when A = a or A = 0 , 
e(Aj_) = ^(X^) when A = -a/2 , 
and e(Aj^) > ^(^2^ ^^^^ A *= -a , 

Unless K = 2, It Is difficult to determine the relationship between 
B(A^) and 3(A^) when A Is not a constant multiple of a. When K = 2, then 
A and a are scalars* 

The further development of this topic will only coiiSlder the situation 
when K 2. This does not substantially limit the Investigation, because 

11 
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even though 3 Is 3 sc3l3r, It still Is 3 function of p-cov3rlates 3nd 
now our concern C3n be considered 3S 3 multiple p3lrwl5e comp3rlson 
problem of tre3tment effects when K > 2, 

Now f(&) =^ aA(3 + 2&) Is 3 llne3r equ3tion In & 3nd for 
& < -3/2, f (&) < 0; for & > 3/2, f(&) > 0; 3nd when & = -3/2, f (&) » 0* 

With this lnform3tion 3 gr3ph of 3nd 6(^2) versus & C3n be 

dr3wn 3nd would h3ve the 3ppe3r3nce of the following flgure« 

^^^> = 1 




= a 



Figure 1 

B(X^) Is the power function when v " 0 
6(^2) is the power function when V = a 
The curves B(X^) and 0(^2) 3Symptotlc to the line 1 3nd 

3chleve their minimum 3t B(X) a, the level of slgnlflC3nce, 

_ o o 

ReC3lllng th3t t = ^ and a ^ - B^)^ If 

Tj^ - T2 < ^(1/2) P(62 ^ then the power of the F-test of the hypothesis 

o o 

Hq: ' T2 is reduced when ^ 62* power of the test Is lncre3Sed 

o o 

when ^ '^2 ^ "(1^2) P(g2 " under the S3me condition, 

ThuS| to summ3rl2e| In the gener3l C3Se when the gr3nd ine3n vector 
of the cov3rl3tes Is not zero, heterogeneous regression slopes lncre3Se 
the prob3blllty of 3 Type I error, rejecting Hq when Hq Is true, 3nd 
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either Increases or decreases the power of the F-test of equal treatment 
effects depending upon the relationship of (x^^ - with respect to 

o o 

PC62 * For a sufficiently large difference of (t^ - t^) the power 

_ o o 

function is close to one, regardless of the value of - 

This result is limited to the case of a large sample experiment 
with p-covariates and where Interest would be in pairwlse comparisons 
of the treatment effects when the number of treatment groups are greater 
than two« 
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